Systematic Study of the Boundary Composition in 



Poisson Boltzmann Calculations 

Parimal Kar^, Yanjie Wei^, Ulrich H. E. Hansmann^'^ 

Siegfried Hofinger^* 



^Department of Physics, Michigan Technological University, 1400 Townsend Drive, Houghton, 
MI, 49331 - 1295, USA 

^John V. Neumann Institute for Computing, Forschungszentrum Jiilich, 52425 Jiilich, Ger- 
many 



* Corresponding Author: 

Siegfried Hofinger 

Michigan Technological University 

Department of Physics 

1400 Townsend Drive, Houghton, Michigan, 49931 - 1295, USA 
Phone: 1-906-487-1496 
Fax: 1-906-487-2933 
Email: shoefing@mtu.edu 

Keywords: Solvation, Continuum Electrostatics, Polarizable Continuum Model, Poisson 
Boltzmann, Implicit Solvation Models 



Abbreviations: DNA, desoxyribosenucleic acid; RNA, ribonucleic acid; PB, Poisson Boltzmann; GB, 
Generalized Born; FDPB, finite difference Poisson Boltzmann; PB/BEM, boundary element method 
Poisson Boltzmann; SASA, solvent accessible surface area; PCM, polarizable continuum model; PDB, 
protein data bank; PDB-REPRDB, representative protein chains from PDB; MSROLL, molecular surface 
program by M. Connolly; SIMS, molecular surface program by Y. Vorobjev; MSMS, molecular surface 
program by M. Sanner; MOLDEN, a molecular graphics program; AMBER, a molecular modelling package 
for biomolecular simulation; DFT, density functional theory; GAUSSIAN-03, a quantum chemical software 
package; GG,AA,VV, di-glycine, di-alanine, di-valine; 



Abstract 



We describe a three-stage procedure to analyze the dependence of Poisson Boltzmann 
calculations on the shape, size and geometry of the boundary between solute and sol- 
vent, our study is carried out within the boundary element formalism, but our results 
are also of interest to finite difference techniques of Poisson Boltzmann calculations. At 
first, we identify the critical size of the geometrical elements for discretizing the bound- 
ary, and thus the necessary resolution required to establish numerical convergence. In 
the following two steps we perform reference calculations on a set of dipeptides in dif- 
ferent conformations using the Polarizable Continuum Model and a high-level Density 
Functional as well as a high-quality basis set. Afterwards, we propose a mechanism 
for defining appropriate boundary geometries. Finally, we compare the classic Poisson 
Boltzmann description with the Quantum Chemical description, and aim at finding 
appropriate fitting parameters to get a close match to the reference data. Surprisingly, 
when using default AMBER partial charges and the rigorous geometric parameters 
derived in the initial two stages, no scaling of the partial charges is necessary and the 
best fit against the reference set is obtained automatically. 
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1 Introduction 



A common way of describing solvation effects to biomolecular structure is to treat the solvent 
as a continuum of characteristic dielectric constant. The biomolecule of interest, i.e. a 
protein, DNA, RNA, glycolipid, etc. is considered in full atomic detail, while the surrounding 
medium is represented as structureless continuum interacting primarily via polarization. 



dispersion, repulsion and cavitation effects 



5|). The underlying physics concerned 



with polarization is then often exp ressed in terms of solutions to the Poisson-Boltzmann 
equation (PB) fl H, sl, 3, 
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13l ). Approximations to the PB — motivated by 



simplified computational protocols — are standard practice e.g. the Generalized Born model 
(GB) jw, lisl ). However, PB and GB are dealing with the polarization term only, and the 
other above mentioned interactions are usually treated by either first-principle (jl6l ) or semi- 
empirical (jirl ) character. 

Solutions to the PB are computed either by the finite difference method (FDPB) 
7, 8, ^ or by the boundary element method (PB/BEM) (12, 13). The latter is particularly 
intriguing since it reduces a three-dimensional integral over the entire volume to a two- 
dimensional surface integral, leading to considerable savings in computational time. Both 
approaches depend fundamentally on the exact definition of the boundary between solute 
and solvent. All definitions are based on the area of the atoms exposed to the solvent, 
for instance the solvent accessible surface area (SASA), the solvent excluded volume, or the 
molecular surface (jlSl ) , which all depend on a chosen set of van der Waals radii (Q, flQ, Q 
assigned to the center of the atoms. 

Given the dependence on the exact geometry and quality of the boundary it appears 
necessary to study the geometric factors that influence the outcome of PB calculations in 
greater detail. This is particularly appropriate for semi-quantitative approaches (1231 ) where 
the demand on accuracy is a very sensitive issue (1241 ). Particular attention has to be drawn 
to factors such as i) surface type and surface resolution, ii) dependence on atomic model 
parameters, i.e. van der Waals radii, iii) generality and physicochemical significance. In this 
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present work we provide such an analysis by focussing on each of these three points separately. 
At first, we employ different surface generation algorithms to a subset of randomly chosen 
protein structures of variable size and shape. PB/BEM calculations are carried out with 
increasing resolution of the boundary. Optimal surface resolution and surface generation 
parameters that guarantee numerical convergence and methodic stability are derived. Next, 
we use these optimized parameters for a set of model peptides and vary the van der Waals 
radii in a systematic way. The reference set of model peptides is considered at a hig h level of 
quantum chemical theory, i.e. PCM (0) using the Becke-98 density functional (1251 ) and the 
basis set of Sadlej (1261 ) . The aim of this second step is to identify optimal van der Waals radii 
within the PB/BEM approach that will lead to boundaries and solute geometries of similar 
size and shape as those used in the high-level PCM calculations. Finally, with the optimized 
parameters determined in the initial two stages we compute actual PB/BEM polarization 
energies in order to obtain a close match with the quantum chemical results obtained from 
the reference set. 



2 Methods 

2.1 Sample Selection, Preparation and Set Up of Structures and 
Computation of Molecular Surfaces with Different Programs 



The actual download site used is the repository PDB-REPRDB (|28| ). Default options are 
applied with the following exceptions: i) Number of residues less than 40 excluded 
- NO, ii) Include MUTANT - NO, iii) Exclude COMPLEX, iv) Exclude FRAG- 
MENT, v) Include NMR - NO, vi) Include Membrane Proteins - NO. A total of 
28 structures of different protein sizes and shapes (see table [I]) are chosen. The PDB codes of 
the samples are, 2ERL, 1P9GA, 1FD3A, 1N13E, IBRF, IPARB, 1K6U, lAVOA, ISCMA, 
lOTFA, IDJTA, 1KU5, 1K3BC, 1R2M, 1CC8, 1L9LA, IZXTD, IGYJA, 1T8K, IXMK, 
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lYNRB, lEZGA, 1C5E, ISAU, 1WN2, IJBE, 1C7K and IWKR. 

Four different programs to calculate molecular surfaces have been employed: the 
Connolly program MSROLL (llSl). the SIMS program (l29l ). the MSMS program by Sanner 
( l30l ) and a self-written program based on an estimation of the SASA (31). However, we 
detected already in early stages of this investigations problems with varying van der Waals 
radii and found indications that the SASA is not an appropriate choice for this kind of 
application. Hence, we have dropped the latter two programs, and most of the analysis is 
done with programs MSROLL and SIMS. 

Downloaded PDB structures are cleaned from multichain entries, HETATM lines 
CONNECT lines, ANISOU lines, counter ions, water molecules and the footer section. Pro- 



gram MOLDEN (1321 ) is used to visualize the downloaded PDB structures after cleaning and 
the force field Tinker Amber is selected before a new PDB file is written out from within 
MOLDEN using option 'Write_With_Hydrogens'. Since MOLDEN always uses the default 
HIP-type in AMBER jargon, HIS residues need to be converted to HIP types, as well as CYS 
residues engaged in disulfide bonds need to be converted to CYX-type residues. Occasional 
cases with PRO being the initial residue are manually edited and initial PROs removed. 
AMBER non-bonded parameters (l33l). i.e. charges and van der Waals radii are assigned to 
all the atoms in the protein structures. In this first part of the study, the vdW-radii are 
increased by a factor of 1.12 and atomic partial charges are scaled down by another factor 

of 0.9 y. 

The MSROLL program is used with varying choices of the fineness value (the -f 
command line argument) which defines the resolution of the surface. With smaller values 
the resolution of the surface becomes better but computational cost will increase. The 
probe radius (the -p command line argument) is set to 1.5 A. Analytically calculated SASA 
and molecular volumes are recorded, and the data file containing triangulation details is 
translated into a human readable format, and critical items (for example almost coinciding 
triangles) removed. 

The SIMS program is used with identical arguments to those employed in MSROLL. 
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Similarly, varying the resolution of the surface triangulation into small sized triangles means 
adjusting the dot-density parameter in SIMS. Higher values for this parameter will yield 
higher surface resolutions but also increase the computational demand. We record the num- 
ber of BE, number of iterations, SASA and volume for comparison. 

2.2 Computation of Polarization Free Energies, AG^°\ Based on 
solutions to the Poisson Boltzmann Equation 

Inner/outer dielectric constants at the molecular boundary are set to 1.0 and 80.0 respec- 



tively. The serial version of the PB/BEM program POLCH ( 1351 ) is used. Critical cases 
with additional secondary cavities located in the ijiterior part of the proteins are excluded. 
AMBER van der Waals radii and partial charges (1 331 ) are applied. Using our own tool chain 



for the assignment allows us to conveniently scale these data, as well as to write out in the 
same instance the corresponding parameter files required by the molecular surface programs. 

2.3 Density Functional Theory Calculations Including the Polar- 
izable Continuum Model on a Reference Set of Dipeptides 



The most prominent combinations of peptidic $, \l/-angles (1361 ) are used to construct different 
conformations of dipeptides. Only homodimers are considered. All 20 types of different 
amino acids are used for this combinatorial approach. Zwitter-ionic forms are built and 
9 conformations per class of amino acid are taken into account leading to all in all 180 
structures. Program "protein. x" from the TINKER package version 4.2 is employed 



Each of these reference structures is subjected to PCM (3j calculations at the Becke-98 



( 1251 ) level of density functional theory (DFT) using the high-quality basis set of Sadlej (1261 ) 



within the Gaussian-03 suite of programs (1381 ). Geometric properties, i.e. the molecular 



volume and the molecular surface well as polarization free energies are extracted 

from the reference calculations and used as a base line when comparing to PB/BEM data. 
The computational demand of these reference calculations is significant. For example, WW- 
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conformations require on the order of 6 weeks (and beyond) single-processor time on modern 
computing architectures. 



3 Results 

3.1 Stage I: Rather small-sized BEs are Needed to Obtain Con- 
sistently Convergent Polarization Free Energies AG^"^ 

We start with PB/BEMA calculations for a set of protein structures (PDB codes summa- 
rized in Table [T]) . The boundary discretization is achieved with two independent programs, 



MSROLL (jl8l ) and SIMS (1291 ). Boundary resolution into BEs is steadily increased with ei- 
ther program and independent PB/BEM results are computed for each particular boundary 
decomposition. A typical plot of the trend of AG^"'' as a function of number of BEs is 
shown in Figure [T]for the protein structure with PDB code 1C5E. Similar plots for the other 
examples in Table [T] are provided as supplementary material. Both approaches converge 
to identical results in the limit of large numbers of BEs. The importance of well-resolved 
boundaries becomes clear from Figure [H Errors on the order of ±40^^ are easily introduced 
when working in the non-converged domain. Connolly's MSROLL program (red triangles 
in Figure [1]) reaches a plateau value in a continuous manner, while the SIMS program (blue 
spheres in Figure [1]) finds its limit value within an alternating sequence. The SIMS program 
reaches convergence much faster than the Connolly program. The quality of the computed 
molecular boundaries is comparable, see, for instance, the values of molecular surfaces and 
volumes (final two columns in Table [11) obtained with either program. SIMS seems to overes- 
timate the volume by a small margin of roughly 1%. The recommended average size of BEs 
for converged results using MSROLL is on the order of 0.11 while SIMS would require an 
average size of 0.31 A^. Both numbers are close to the value of 0.4 A^ advocated in Quantum 



Chemistry (1391 ). 
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3.2 Stage II: Systematic Geometric Comparison to High Level 
Quantum Chemistry Calculations Suggests a Uniform Scaling 
of AMBER van der Waals Radii by a Factor of 1.07 

A reference set of dipeptides in different conformations (9 per species) is constructed. Only 
homodipeptides comprising all 20 types of naturally occurring amino acids are considered. 
Thus a total number of 180 dipeptidic reference structures is set up. The zwitterionic form 
is used throughout. Each of these structures is computed at the Becke-98 level of theory 



(1251 ) using the basis set of Sadlej (1261 ) and the PCM model (l3|) for solvation free energies. 
Geometric properties such as the cavity volume and the cavity surface area are extracted 
from each of the reference calculations. All 180 structures are also computed within the 
PB/BEM approach using optimized parameters for the boundary resolution determined in 
Stage I of this study. However, only the SIMS program is used. We define a global deviation 
from the reference data by 

1 20 1 9 I 

A""^' = ^ E ^ E JiSurf!:f^ - Surf^'^l^r'^f (1) 

where j runs over the conformations and i over the different types of homodipeptides, i.e. 
GG, AA, VV, etc. The parameter a refers to a specific scaling factor used when constructing 
the boundaries within the PB/BEM approach. In particular this scaling makes the van der 
Waals radii larger or smaller by a certain fraction. The AMBER default set of van der Waals 



radii is used (1331 ) . A similar criterion is used for comparing molecular volumes, 

1 20 1 9 I 

A"^"' = ^ E ^ E ^ivoif^^^ - voirfr'y (2) 

and the dependence on the scaling factor a is shown in Figures [2] and [31 As becomes 
clear from Figures [2] and [3] the best match to the reference data is obtained when scaling the 
AMBER van der Waals radii by a factor of 1.07. Detailed data with respect to conformational 
averages per type of dipeptide are shown in Tables [III and IIIII 
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3.3 Stage III: Charge Scaling is Not Required 



Using the optimized parameters obtained in the previous two stages leads us to the final 
step of directly comparing polarization free energies AG^"' computed within the PB/BEM 
approximation and at the PCM level of theory. The idea is to identify another uniform scaling 
factor (3 which applied to the AMBER default charges would result in an optimal match to 
the reference polarization free energies. Thus another deviation criterion is introduced, 

that allows to identify the optimal value of p. The dependence of the PB/BEM polarization 
free energies on the charge scaling factor (3 is shown in Figure |H The trend shown in Figure 
m suggests an optimal value of (3 very close to 1.0, hence no charge scaling is required. This 
result i) emphasizes the broad applicability of AMBER partial charges and ii) circumvents 
conceptual difficulties that would arise when charges had to be scaled, i.e. modified net 
charges in proteins, non-neutral forms, etc. A detailed analysis with respect to the magnitude 
of the average deviation of each particular type of dipeptide studied is shown in Table IIVI 



Discussion 



Motivated by our recent high-performance solution to Poisson Boltzmann calculations (l35l ) 
we have now tested the infiuence of the many critical parameters involved. One obvious 
issue is the exact choice and composition of the boundary between solute and solvent. At 
first, we have to ensure the numerical stability within the selected level of approximation. In 
order to address this problem we have carried out PB/BEM calculations on a large sample 
of different proteins. When using different programs to create the boundary surface and in- 
creasing systematically the resolution of these surfaces into small-sized boundary elements, 
a recommended threshold size of about 0.31 A^ for the average BE is identified when using 



program SIMS (1291 ) which showed faster convergence than the well-known Connolly program 



(jlSl ). Although giving rise to very fine- resolved boundary surfaces, hence large numbers of 
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BEs, this value is close to the corresponding value of 0.4 frequently advised in Quantum 
Chemical models (1391 ) . As a consequence, even proteins of modest size thus require consid- 
eration of vast numbers of BEs (see for example Table [T]), and the importance of efficient 
means of solving the computational problem is underlined again. 

After having established the necessary degree of boundary partitioning in the first 
stage, we performed a systematic comparison against a reference set of dipeptides computed 
at a high level of Quantum Chemical theory. Consideration of geometric factors revealed 
that when applying a scaling factor of about 1.07 to AMBER default van der Waals radii, 
rather good agreement can be reached between the reference geometries and the geometries 
in the PB/BEM approach. The recommended value of 1.07 is somewhat smaller than a factor 
found previously (1.12 of ref (1341)) and reflects the much finer resolved boundary surfaces 
used in this present work. 

The final step was to compare actual calculations of the polarization free energies 
to each other. Following previous attempts, we wanted to derive another scaling factor 
that, when applied to AMBER partial charges, would yield a close match to the reference 
polarization free energies. The trend visible in Figure H] indicates that no scaling of the 
charges is necessary: they are already close to optimal. This is an unexpected — but very 
welcome — result, as it eliminates potential secondary problems that would emerge with 
modifying charges. Again, this is another consequence of the much finer resolved boundary 
surfaces in this present work as opposed to previous results (1341 ) where a scaling factor of 
0.9 had been found. 



5 Conclusion 

Combined employment of small-sized BEs (~ 0.3 on average), slightly increased AMBER 
van der Waals radii (by a factor of 1.07), and default AMBER partial charges leads to good 
quality estimates of the polarization free energy, AG^"', for proteins within the PB/BEM 
framework. 
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Table I: PDB codes of studied structures and corresponding number of residues. Columns 
three and four show the number of BEs needed to reach converged PB/BEM results using 
molecular surface algorithms MSROLL and SIMS respectively. Final two columns show 
average molecular surface areas and average molecular volumes derived when using data of 
both programs, MSROLL and SIMS. The differences obtained when subtracting the SIMS 
results from the MSROLL results are given in parentheses. 
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39316 


13659 


3244 (-1) 


9596 


(-74 


1CC8 


73 


27668 


15091 


3644 


(+2) 


11094 


(-65 


1L9LA 


74 


20278 


11636 


4182 


(+5) 


11728 ( 


-112 


IZXTD 


76 


37335 


11259 


4089 


(+8) 


10809 


(-93 


IGYJA 


76 


44770 


13665 


488^ 


5 (-3) 


11464 ( 


-118 


1T8K 


77 


35978 


13846 


3925 


(+4) 


11410 ( 


-119 


IXMK 


79 


56033 


16468 


4294 


(+9) 


12288 


(-98 


lYNRB 


80 


31529 


12630 


4417 ( 


+16) 


11911 ( 


-157 


lEZGA 


84 


34628 


9122 


3258 


(+3) 


10103 


(-95 


1C5E 


95 


48306 


19880 


448 


0(0) 


13285 ( 


-110 


ISAU 


115 


47613 


17765 


5197 ( 


+25) 


17897 ( 


-116 


1WN2 


121 


51325 


21555 


5614 ( 


+ 15) 


17836 ( 


-118 


IJBE 


128 


58119 


16729 


5409 (+22) 


18905 ( 


-188 


1C7K 


132 


54104 


16675 


538 


9 (0) 


18858 ( 


-182 


IWKR 


340 


74167 


55378 


11008 


(-41) 


47105 ( 


-299 



Table II: Comparison of average molecular surfaces based on unsealed and scaled AMBER 

van der Waals radii with data from PCM reference calculations. The average comprises 9 

different conformations per type of dipeptide considered. The value in parentheses monitors 

that conformation that deviates most severely from the mean. 

Dipeptide Mean Surface Mean Surface Mean Surface 

lype AMBER Unsealed AMBER Scaled PCM Reference 





r 9 On 




r 9 On 




r 9 On 


AA 


191.764 (5 


205) 


204.388 (3 


697) 


214.747 (4.955) 


CC 


214.592 (4 


622) 


229.274 (5 


461) 


232.910 (6.522) 


DD 


228.687 (6 


032) 


242.724 (5 


972) 


240.839 (6.776) 


EE 


273.402 (5 


835) 


287.557 (6 


951) 


283.025 (5.766) 


GG 


150.255 (4 


460) 


161.637 (4 


936) 


167.764 (3.249) 


II 


279.535 (10 


274) 


294.402 (11 


331) 


302.173 (12.233) 


KK 


314.712 (6 


273) 


332.593 (7 


677) 


340.545 (7.599) 


LL 


276.435 (10 


012) 


290.497 (12 


440) 


293.172 (10.465) 


MM 


301.384 (6 


691) 


318.033 (8 


261) 


329.815 (8.374) 


NN 


232.466 (6 


072) 


247.553 (7 


325) 


247.040 (7.310) 


QQ 


276.939 (5 


957) 


293.663 (7 


531) 


292.648 (6.582) 


RR 


354.636 (6 


925) 


377.310 (7 


568) 


380.408 (6.739) 


SS 


196.528 (4 


982) 


207.904 (4 


610) 


212.107 (4.695) 


TT 


224.181 (8 


047) 


238.570 (8 


359) 


239.112 (10.124) 


W 


251.913 (8 


574) 


265.008 (8 


296) 


276.423 (8.140) 


YY 


340.272 (17 


100) 


356.042 (17 


293) 


346.378 (14.704) 


FF 


329.058 (17 


123) 


343.245 (17 


402) 


326.947 (15.489) 


WW 


355.790 (26 


425) 


377.209 (27 


865) 


361.573 (25.182) 


HH 


282.802 (13 


007) 


299.235 (12 


829) 


296.885 (11.461) 


PP 


224.999 (10 


768) 


237.525 (10 


500) 


233.118 (9.900) 



Table III: Comparison of average molecular volumes based on unsealed and scaled AMBER 

van der Waals radii with data from PCM reference calculations. The average comprises 9 

different conformations per type of dipeptide considered. The value in parentheses monitors 

that conformation that deviates most severely from the mean. 

Dipeptide Mean Volume Mean Volume Mean Volume 

lype AMBER Unsealed AMBER Scaled PCM Reference 





r 9 On 

[A3] 






r 9 O 






r 9 O 

[A3 






AA 


191.804 


(4 


553) 


215.661 


(3 


355) 


228.591 


(2 


885) 


CC 


223.713 


(3 


799) 


252.135 


(4 


069) 


255.902 


[3 


594) 


DD 


242.406 


(4 


242) 


270.928 


(5 


037) 


260.904 


[4 


723) 


EE 


296.037 


(4 


888) 


327.303 


(3 


864) 


311.645 


(3 


810) 


FF 


381.346 


(5 


875) 


417.696 


(7 


992) 


388.363 


(6 


223) 


GG 


136.315 


(3 


565) 


154.314 


(2 


631) 


164.287 


(2 


048) 


HH 


317.309 


(3 


605) 


353.465 


(5 


228) 


340.591 


[5 


324) 


11 


323.590 


(6 


689) 


357.080 


(7 


564) 


367.182 


[6 


520) 


KK 


343.720 


(3 


747) 


382.391 


(4 


384) 


388.641 


(3 


903) 


LL 


313.537 


(5 


063) 


346.507 


(6 


450) 


344.950 


(7 


458) 


MM 


325.073 


(5 


069) 


363.563 


(5 


484) 


377.789 


(3 


905) 


NN 


248.729 


(4 


628) 


278.449 


(4 


968) 


271.305 


[5 


415) 


PP 


242.861 


(8 


154) 


269.561 


(9 


116) 


263.128 


[9 


957) 


QQ 


301.279 


(3 


985) 


336.468 


(5 


590) 


325.660 


(5 


591) 


RR 


384.833 


(4 


089) 


431.811 


(4 


444) 


424.874 


(3 


559) 


SS 


198.981 


(3 


533) 


221.590 


(3 


162) 


225.193 


(2 


964) 


TT 


244.397 


(5 


673) 


272.848 


(7 


051) 


273.546 


[6 


533) 


VV 


282.296 


(6 


114) 


311.383 


(6 


895) 


330.378 


(8 


172) 


WW 


431.248 ( 


14 


910) 


480.974 (17 


118) 


447.693 (15 


019) 


YY 


393.622 (5 


433) 


433.845 (7 


433) 


407.695 (5 


466) 



Table IV: Comparison of average PB/BEM polarization free energies AG^"' using AMBER 
default charges to corresponding data obtained from PCM reference calculations. The av- 
erage comprises a variable number of conformations for each different type of dipeptide 
considered depending on how many PCM calculations terminated faithfully. Numbers in 
parentheses reflect maximum deviation from the mean. 



Dipeptide 


Mean AG^°''^^/^^^ 


Mean AG^"^'^^^ 


Mean AAG^"' 


Number of 


Type 


AMBER Default Charges 


PCM Reference 


Deviation 


References 




[kcal/mol] 


[kcal/mol] 


[kcal/mol] 




AA 


-91.36 ( 8.56 ) 


-83.89 (10.12 ) 


7.47 


9 


CC 


-115.11 (11.02 ) 


-96.80 (12.83 ) 


18.31 


9 


DD 


-296.25 (17.08 ) 


-285.27 (18.24 ) 


10.98 


9 


EE 


-266.54 (14.09 ) 


-259.29 (13.76 ) 


7.25 


9 


GG 


-96.52 (10.09 ) 


-89.41 (11.36 ) 


7.11 


9 


II 


-82.72 ( 7.49 ) 


-75.97 ( 8.70 ) 


6.77 


9 


KK 


-249.63 (16.51 ) 


-236.37 (19.64 ) 


13.26 


9 


LL 


-85.54 ( 7.20 ) 


-64.51 ( 8.64 ) 


21.03 


9 


MM 


-88.82 ( 7.55 ) 


-82.10 ( 9.42 ) 


6.72 


9 


NN 


-105.11 ( 8.19 ) 


-101.80 (12.20 ) 


4.25 


9 


QQ 


-119.08 (10.88 ) 


-115.33 (12.60 ) 


3.89 


9 


RR 


-235.39 (17.79 ) 


-228.71 (21.45 ) 


6.68 


6 


SS 


-112.78 (13.38 ) 


-105.47 (13.90 ) 


7.32 


9 


TT 


-106.87 (12.06 ) 


-100.61 (12.88 ) 


6.55 


9 


VV 


-85.17 ( 7.44 ) 


-77.46 ( 8.73 ) 


7.70 


9 


YY 


-93.35 ( 4.36 ) 


-90.11 ( 8.48 ) 


3.59 


5 


FF 


-89.92 (10.55 ) 


-82.51 (13.94 ) 


7.41 


6 


HH 


-237.74 (19.08 ) 


-236.06 (22.15 ) 


3.66 


9 


PP 


-79.15 ( 5.73 ) 


-82.71 ( 7.50 ) 


3.56 


9 


WW 


-100.50 ( 4.27 ) 


-88.11 (12.93 ) 


12.39 


2 
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